Augmentation modification based on user interaction with augmented reality scene

ABSTRACT

Apparatuses, methods, and storage media for modifying augmented reality in response to user interaction are described. In one instance, the apparatus for modifying augmented reality may include a processor, a scene capture camera coupled with the processor to capture a physical scene, and an augmentation management module to be operated by the processor. The augmentation management module may obtain and analyze the physical scene, generate one or more virtual articles to augment a rendering of the physical scene based on a result of the analysis, track user interaction with the rendered augmented scene, and modify or complement the virtual articles in response to the tracked user interaction. Other embodiments may be described and claimed.

TECHNICAL FIELD

The present disclosure relates to the field of augmented reality, and in particular, to modifying augmented reality in response to user interaction.

BACKGROUND

The background description provided herein is for the purpose of generally presenting the context of the disclosure. Unless otherwise indicated herein, the materials described in this section are not prior art to the claims in this application and are not admitted to be prior art by inclusion in this section.

Throughout a person's lifetime, the person may relate to both a physical world in which the person lives and a virtual world, in which the person may play, learn, work, etc. Augmented reality and mixed reality applications create environments that may blur the limits between virtual and real worlds. Under the current state of the art, however, augmented reality may include augmentations of a physical scene with virtual artefacts, which may not be responsive to interactions with the user based on the user's natural interaction modalities, such as facial expressions, gestures, voice commands, and the like.

BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments will be readily understood by the following detailed description in conjunction with the accompanying drawings. To facilitate this description, like reference numerals designate like structural elements. Embodiments are illustrated by way of example, and not by way of limitation, in the Figures of the accompanying drawings.

FIG. 1 is a block diagram illustrating an example apparatus for modifying augmented reality in response to user interaction, in accordance with various embodiments of the present disclosure.

FIG. 2 illustrates an example computing environment in accordance with various embodiments of the present disclosure.

FIG. 3 illustrates an example process for modifying augmented reality in response to user interaction, in accordance with some embodiments.

FIG. 4 illustrates an example computing environment suitable for practicing various aspects of the disclosure, in accordance with various embodiments.

FIG. 5 illustrates an example non-transitory computer-readable storage medium having instructions configured to practice all or selected ones of the operations associated with the processes described in reference to FIGS. 1-3.

DETAILED DESCRIPTION

In the following detailed description, reference is made to the accompanying drawings which form a part hereof wherein like numerals designate like parts throughout, and in which is shown by way of illustration embodiments that may be practiced. It is to be understood that other embodiments may be utilized and structural or logical changes may be made without departing from the scope of the present disclosure. Therefore, the following detailed description is not to be taken in a limiting sense, and the scope of embodiments is defined by the appended claims and their equivalents.

Computing apparatuses, methods and storage media for modifying augmented reality based on user interaction with rendered augmented reality are described herein. In one instance, the apparatus for modifying augmented reality may include a processor, a scene capture camera coupled with the processor to capture a physical scene, and an augmentation management module to be operated by the processor. The augmentation management module may obtain and analyze the physical scene, generate one or more virtual articles to augment a rendering of the physical scene based on a result of the analysis, track user interaction with the rendered augmented scene, and modify or complement the virtual articles in response to the tracked user interaction.

Various operations may be described as multiple discrete actions or operations in turn, in a manner that is most helpful in understanding the claimed subject matter. However, the order of description should not be construed as to imply that these operations are necessarily order dependent. In particular, these operations may not be performed in the order of presentation. Operations described may be performed in a different order than the described embodiment. Various additional operations may be performed and/or described operations may be omitted in additional embodiments.

For the purposes of the present disclosure, the phrase “A and/or B” means (A), (B), (A) or (B), or (A and B). For the purposes of the present disclosure, the phrase “A, B, and/or C” means (A), (B), (C), (A and B), (A and C), (B and C), or (A, B, and C).

The description may use the phrases “in an embodiment,” or “in embodiments,” which may each refer to one or more of the same or different embodiments. Furthermore, the terms “comprising,” “including,” “having,” and the like, as used with respect to embodiments of the present disclosure, are synonymous.

As used herein, the term “logic” and “module” may refer to, be part of, or include an application specific integrated circuit (ASIC), an electronic circuit, a processor (shared, dedicated, or group), and/or memory (shared, dedicated, or group) that execute one or more software or firmware programs, a combinational logic circuit, and/or other suitable components that provide the described functionality.

FIG. 1 is a block diagram illustrating an example apparatus 100 for modifying augmented reality in response to user interaction, in accordance with various embodiments. As illustrated, the apparatus 100 may include a processor 112, a memory 114, an augmentation management module 140, display 134, model library 138, a scene capture camera 102, and an interaction capture camera 104, communicatively coupled with each other. The apparatus 100 may be, for example, a laptop computing device, a tablet computing device, a mobile computing device, or all-in-one (AIO) computing device.

Scene capture camera 102 may be disposed in or coupled with the apparatus 100 to capture images/video of a physical scene. The physical scene may include images of one or more physical objects. In embodiments, the images captured by camera 102 may include both color information and depth information. Interaction capture camera 104 may be disposed in or coupled with the apparatus 100 to capture images/video of user interaction with an augmented physical scene rendered to the user, as will be described in greater detail in FIG. 2.

Cameras 102 and 104 may be peripherally attached or integrated into apparatus 100. Cameras 102 and 104 may be communicatively coupled with apparatus 100 via a wired or wireless connection suitable for transmitting data captured by cameras 102 and 104. As indicated above, cameras 102 and 104 may be configured to capture both depth and color information. For example, in some embodiments, cameras 102 and 104 may incorporate a depth sensor, such as, an infrared emitter utilized in combination with an infrared camera, as well as a two-dimensional (2D) image capture sensor, such as a red, green, and blue (RGB) camera sensor. Generally, cameras 102 and 104 may have 2D or three-dimensional (3D) image capture capabilities and may be embodied as a 3D camera, depth camera, or bifocal camera, and/or be otherwise capable of generating a depth image, channel, or stream. Cameras 102, 104 may include a still camera, a video camera, a webcam, an infrared (IR) camera, or other device capable of capturing video and/or images. At least interaction capture camera 104 may include a user interface (e.g., microphone) for voice commands applied to augmented reality rendered to a user.

Apparatus 100 may be configured to receive the captured images of physical scene and user interaction with the scene from cameras 102 and 104 and provide the captured images to augmentation management module 140.

The augmentation management module 140 may obtain and analyze the physical scene from scene capture camera 102 and generate one or more virtual articles to augment a rendering of the physical scene, based on a result of the analysis. The augmentation management module 140 may also track user interaction with the rendered augmented scene based on images provided by interaction capture camera 104, and modify or complement the generated virtual articles, in response to the tracked user interaction. The operation of the augmentation management module 140 is described below in greater detail.

The augmentation management module 140 may include one or more components (not shown) configured to merge the color information and depth information included with the images to create 3D renderings of the images provided by cameras 102 and 104. The augmentation management module 140 may further include a scene augmentation component 110 configured to analyze the physical scene images provided by scene capture camera 102 and generate one or more virtual articles to augment a rendering of the physical scene, based on a result of the analysis. The rendering of the physical scene on display 134 may be provided, for example, by the augmentation management module 140 or by a respective management component of an operating system of the apparatus 100.

The scene augmentation component 110 may include an object recognition and analysis module 122 to recognize predefined physical objects contained in the images. This may be accomplished, for example, through a feature extraction and comparison of the extracted features with features of predefined physical objects. In embodiments, the predefined physical objects may be contained in a database of predefined physical objects.

In some embodiments, the scene augmentation component 110 may be configured to enable a user of apparatus 100 to define new physical objects. This may be accomplished by capturing one or more images of the physical object via camera 102. The object recognition and analysis component 122 may be configured to extract features associated with the physical object from the one or more images and generate feature data sufficient to identify the physical object in an image of a physical scene. These features may be stored in a physical object repository (e.g., memory 114) to be utilized for identifying the physical object in future physical scenes.

Some of the recognized objects may include markers, stickers, or other indications of virtual articles that may be relevant for augmentation of the recognized objects in the physical scene. The object recognition and analysis component 122 may be configured to generate one or more virtual articles and augment the 3D rendering of the recognized physical object with one or more virtual articles for output on display 134. For example, the object recognition and analysis module 122 may be configured to recognize key colors or other identifiers in markers associated with the object, to trigger a generation of one or more virtual articles in association with the object, in response to the recognition of the color or other identifier. For example, if a physical object comprises a tree, it may be augmented with the addition of one or more virtual birds, squirrels, nuts, fruit, etc., e.g., according to identifiers associated with the tree and identified by the object recognition and analysis module 122.

In some embodiments, the scene augmentation component 110 may be configured to dynamically track the location of the physical objects in the rendered scene to determine movement of the physical objects and cause the generated virtual articles to behave in accordance with the movement. This may be considered a type of context based augmentation, where the context may be based on the movement of the physical objects.

The described physical scene augmentation techniques are provided for purposes of illustration and should not be viewed as limiting this disclosure. It will be appreciated that different methods of reality augmentation may be applied to a physical scene captured by scene capture camera 102.

In embodiments, the augmentation management module 140 may be configured to track user interaction with the augmented physical scene rendered on display 134 in real- or near-real time, based on images of user interaction captured and provided by interaction capture camera 104 (in some embodiments, in conjunction with user interaction information provided by sensors that may be distributed around the apparatus 100 for user interaction tracking purposes as described below).

User interaction with the augmented rendered scene may take different forms. For example, indications of user interaction may include, but may not be limited to, a gesture, a change of a user's facial expression, a verbal command issued by a user, a change in the user's eye gaze, a change of the user's posture, a change of the user's head pose, or a combination thereof. User interaction may include various types of interaction with one or more virtual articles provided by the scene augmentation component 110 to the augmented scene rendered on display 134.

The interaction capture camera 104 may be configured to capture user interaction with the rendered augmented scene and provide the captured information to the user interaction tracking component 120. In embodiments, the interaction capture camera 104 may be placed in or around the apparatus 100 to face the user of the apparatus 100, in order to capture a facial expression of the user, gestures, and/or changes in posture, pose, eye gaze, and the like.

In some embodiments, apparatus 100, in addition or in the alternative to interaction capture camera 104, may include a plurality of sensors 136 to track indications of user interactions with the rendered augmented scene. The sensors 136 may include proximity sensors, inertial sensors, optical sensors, light sensors, audio sensors, temperature sensors, thermistors, motion sensors, vibration sensors, microphones, cameras, and/or other types of sensors.

The sensors 136 may be distributed across the apparatus 100 in a number of different ways. For example, some sensors (e.g., a microphone to capture audio associated with a user voice command) may reside in the interaction capture camera 104, while others may be embedded around the body of the apparatus 100. Some sensors, such as motion sensors (e.g., accelerometers, gyroscopes, and the like) may be placed in or around the object of the physical scene captured by the scene capture camera 102 to detect position and speed change associated with the object, and the like.

The sensors 136 may include a recording device to record content associated with user interaction, such as images of user interaction or voice commands, for example. The recording device may be embodied as any external peripheral or integrated device.

The augmentation management module 140 may further include a user interaction tracking component 120 configured to track, in real- or near-real time, user interaction with the augmented physical scene, based on user interaction information provided by interaction capture camera 104 and/or sensors 136.

The user interaction tracking component 120 may include a processing component 150 configured to receive, pre-process (e.g., digitize and timestamp) data provided by the interaction capture camera 104 and/or sensors 136 and provide the pre-processed data for further processing described below.

The user interaction tracking component 120 may include a voice recognition component 124 configured to recognize voice commands provided by the user in association with particular virtual articles in the rendered augmented scene. In embodiments, the voice recognition component 124 may include a converter to match voices for multiple users of apparatus 100, who may be eligible to provide the voice commands associated with virtual articles.

The user interaction tracking component 120 may include a facial expression tracking component 115 configured to track facial expressions of the user (e.g., mouth or eye movement), detect facial expression changes, record facial expression changes, and interpret the changes in the user's facial expression in relation to particular virtual articles in the rendered augmented scene. For example, the facial expression tracking component 115 may analyze user facial expressions and enable manipulation of the virtual articles in the augmented scene in response to the changes in the facial expression and/or audio narrative provided by the user via voice commands. The facial expression tracking component 115 may also be configured to track the user's gaze and provide eye tracking information regarding changes in the user's gaze with respect to a virtual article.

The user interaction tracking component 120 may further include a gesture tracking component 116 to track gestures provided by the user in relation to particular virtual articles in the rendered augmented scene. Gestures, alone or in combination with other indications of user interaction, such as voice commands, may serve as indications to manipulate the virtual articles in the augmented scene in response to the indications.

The augmentation management module 140 may include an augmentation modification component 130 (for illustration purposes shown as part of the scene augmentation component 110) configured to modify or complement the one or more virtual articles, in response to the tracked user interaction. For example, the augmentation modification component 130 may be configured to align a virtual article in the rendered augmented scene with the indicated user interaction with the virtual article in the rendered augmented scene detected by the user interaction tracking component 120.

In another example, the augmentation modification component 130 may be configured to alter the position of a virtual article in the rendered augmented scene in response to the indication of the user interaction with the virtual article provided by the user interaction tracking component 120.

In another example, the augmentation modification component 130 may be configured to alter (e.g., change dimensions, color, etc.) the virtual articles in the rendered augmented scene in response to the indication of the user interaction with the virtual articles in the rendered augmented scene.

To facilitate the augmentation modification, the apparatus 100 may include a model library 138 configured as a repository for augmentation modifications to a virtual article based on detected interaction indications. For example, the model library 138 may include rules configured to determine modifications or complements to a virtual article based on associated indications or user interaction with the virtual article, based on, for example, heuristics. For example, the model library 138 may store an index of gestures, voice commands, or facial expressions with particular properties and corresponding types of modification or complements to an associated virtual article.

It should be appreciated that, in some embodiments, any or all of the illustrated components, such as the cameras 102, 104, and/or the sensors 136, may be separate from and remote to, but communicatively coupled with, the apparatus 100. In general, some or all of the functionalities of the apparatus 100, such as processing power and/or memory capacity, may be used or shared with the augmentation management module 140. Furthermore, at least some components of the augmentation management module 140 (e.g., library 138) may be accessible by (e.g., communicatively coupled with) the apparatus 100, but may not necessarily reside on the apparatus 100. One or more of the components mentioned above may be distributed across the apparatus 100 and/or reside on a cloud computing service to host these components. Additionally, in some embodiments, one or more of the illustrative components may be incorporated in, or otherwise form a portion of, another component. For example, the memory 114, or portions thereof, may be incorporated in the processor 112 in some embodiments. It will be understood that augmentation management module 140 may comprise hardware, software (e.g., stored in memory 114), or a combination thereof.

FIG. 2 illustrates an example computing environment 200 in accordance with various embodiments of the present disclosure. At least some of the components of the computing environment 200 may correspond to the components of apparatus 100 of FIG. 1. Computing environment 200 may include apparatus 100, such as a laptop computing device that may include display 134 and the augmentation management module 140. Computing environment 200 may further include the scene capture camera 102 and interaction capture camera 104 coupled with computing device 100. While depicted herein as being integrated into computing device 100, cameras 102 and 104 may, in some embodiments, be peripherally attached to the computing device 100.

As shown, scene capture camera 102 is facing a physical scene substantially in front of the computing device 100, to enable capturing 204 (indicated with dashed lines) of the physical scene 206. As shown, the physical scene includes an object (cup) 208. Interaction capture camera 104 is facing a user 210, to enable capturing 212 (indicated with dashed lines) of user interaction with an augmented scene 214 rendered on the display 134. The computing device 100 may be placed in a stationary position, e.g., on a surface 202, to enable capture of the physical scene 206 with the object 208, user interaction with the augmented scene 214, and scene 214 augmentation modification described herein.

In operation, the scene capture camera 102 may capture 204 the physical scene 206 with the object 208 and provide the captured scene to the augmentation management module 140 for processing and augmentation as described in reference to FIG. 1. The augmentation management module 140 may analyze the captured scene 206, identify the object 208, and provide augmentation of the captured scene 214. For example, the augmentation management module 140 may generate one or more relevant virtual articles to augment the recognized object 208. In the illustrated example of FIG. 2, object 208 may be augmented with virtual articles, such as a flower 216 and butterfly 218, as shown in FIG. 2.

Substantially simultaneously with the scene capture camera 102 operation, the interaction capture camera 104 may capture 212 user interaction with the augmented scene 214 rendered on the display 134, and provide the captured image to the augmentation management module 140 for processing and analysis as described in reference to FIG. 1. As described in reference to FIG. 1, the user interaction may be captured by a plurality of sensors 136 (not shown in FIG. 2), in addition to the interaction capture camera 104. However, it will be assumed that the camera 104 is operating in concert with the sensors to capture user interaction and the interaction capture will be described in relation to the interaction capture camera 104, for simplicity of description.

For example, the interaction capture camera 104 may capture an image of the user 210 and provide the captured image to the augmentation management module 140 for analysis. The analysis may include identification and gathering information about user 210's personal modalities, such as posture, position relative to the camera 104, head pose, facial expression, eye gaze, hand gestures, and the like. Then, camera 104 may continuously or periodically capture, in real- or near-real time, user personal modalities and provide the captured information to the augmentation management module 140, for tracking and detecting indications of user interaction with the augmented rendered scene 214.

As mentioned above, the user 210 may interact with augmented rendered scene 214 in a variety of ways. For example, the user 210 may use hand gestures, voice commands, facial expressions, eye movement, or combinations of voice commands, gestures, and facial expressions.

For example, a user gesture may be associated with a voice command (e.g., via the recording device described in reference to FIG. 1). The user 210 may point at an object in the scene 214 and provide an audio command that a particular type of action be taken with regard to a particular virtual article.

In the example depicted in FIG. 2, the user 210 may manipulate virtual article 218 (butterfly), such as attempt to catch the butterfly with her hands 220, or attempt to move the butterfly with the movement of her eyes 222.

In response to the detected user interaction (e.g., attempt to catch the butterfly), the augmentation management module 140 may modify or complement the virtual articles 216, 218 in the rendered augmented scene 214. For example, the augmentation management module 140 may align the virtual article (e.g., butterfly 218) in the rendered augmented scene 214 with the indicated user interaction, such as attempt to catch the butterfly 218. In other words, the augmentation management module 140 may cause the butterfly 218 to virtually move towards, e.g., “into” the user's hands 220 in the rendered augmented scene 214.

In another example, the augmentation management module 140 may alter the position of the butterfly 218 in response to the user's hands 220's attempt to catch it, such as virtually move the butterfly 218 “away” from the user's hands 220 in the scene 214.

In another example, the augmentation management module 140 may alter the virtual article in response to the detected indication of the user interaction with the virtual article. For example, the augmentation management module 140 may cause the flower 216 to virtually blossom in response to the user's hands 220 waiving or other indication of the user interaction.

In another example, the augmentation management module 140 may align the virtual article in response to the user 210's change of a view point (e.g., via changing head pose).

The augmented scene 214 with the described manipulations of the virtual articles may be re-rendered to the user 210 on the display 134.

FIG. 3 illustrates an example process for modifying augmented reality in response to user interaction, in accordance with some embodiments. The process 300 may be performed, for example, by the apparatus 100 (e.g., computing device) configured with augmentation management module 140 described in reference to FIGS. 1 and 2.

The process 300 may begin at block 302, and include obtaining and analyzing information about the physical scene and about user characteristics related to user interaction with augmented reality. As described in reference to FIG. 2, the scene capture camera may capture a physical scene and provide the captured scene to the augmentation management module for processing and augmentation. The augmentation management module may analyze the captured scene and identify objects in the scene, to provide augmentation of the captured scene.

Substantially simultaneously, the interaction capture camera may capture an image of the user of the computing device and provide the captured image to the augmentation management module for analysis. The analysis may include identification and gathering information about the user's personal modalities, such as posture, position relative to the camera 104, head pose, facial expression, eye gaze, hand gestures, and the like.

At block 304, the process 300 may include generating one or more virtual articles to augment a rendering of the physical scene that may be displayed to the user, based on the analysis of the physical scene. More specifically, one or more relevant virtual articles may be generated to augment the recognized object in the rendered augmented scene.

At block 306, the process 300 may include augmenting the rendering of a physical scene with the virtual articles that were generated at block 304.

At block 308, the process 300 may include tracking user interaction with the rendered augmented scene. For example, the interaction capture camera may continuously or periodically capture, in real- or near-real time, user personal modalities and provide the captured information to the augmentation management module, for tracking and detecting indications of user interaction with the augmented rendered scene.

At block 310, the process 300 may include modifying or complementing one or more articles in the rendered augmented scene in response to the tracking of the user interaction. For example, if an indication of user interaction with a virtual article in the rendered augmented scene is detected, the augmented scene may be modified by aligning the virtual article in the rendered augmented scene with the indicated user interaction in the rendered augmented scene. In another example, the augmented scene may be modified by altering a position of the virtual article in the rendered augmented scene in response to the detected indication of the user interaction. In another example, the augmented scene may be modified by altering the virtual article in the rendered augmented scene in response to the indication of the user interaction.

At decision block 312, the process 300 may include a determination of whether the user session with the computing device has ended. If the session has not ended, the process 300 may return to block 308. Otherwise, the process 300 may end.

It should be understood that the actions described in reference to FIG. 3 may not necessarily occur in the described sequence. For example, actions corresponding to block 308 may take place substantially concurrently with actions corresponding to block 310.

FIG. 4 illustrates an example computing device 400 suitable for use to practice aspects of the present disclosure, in accordance with various embodiments. As shown, computing device 400 may include one or more processors or processor cores 402, and system memory 404. For the purpose of this application, including the claims, the terms “processor” and “processor cores” may be considered synonymous, unless the context clearly requires otherwise. The processor 402 may include any type of processors, such as a central processing unit (CPU), a microprocessor, and the like. The processor 402 may be implemented as an integrated circuit having multi-cores, e.g., a multi-core microprocessor. The computing device 400 may include mass storage devices 406 (such as diskette, hard drive, volatile memory (e.g., DRAM), compact disc read only memory (CD-ROM), digital versatile disk (DVD) and so forth). In general, system memory 404 and/or mass storage devices 406 may be temporal and/or persistent storage of any type, including, but not limited to, volatile and non-volatile memory, optical, magnetic, and/or solid state mass storage, and so forth. Volatile memory may include, but not be limited to, static and/or dynamic random access memory. Non-volatile memory may include, but not be limited to, electrically erasable programmable read only memory, phase change memory, resistive memory, and so forth.

The computing device 400 may further include input/output (I/O) devices 408 (such as a display 134), keyboard, cursor control, remote control, gaming controller, image capture device, and so forth and communication interfaces (comm. INTF) 410 (such as network interface cards, modems, infrared receivers, radio receivers (e.g., Bluetooth), and so forth. I/O devices 408 may further include cameras 102 and 104 and sensors 136, as shown.

The communication interfaces 410 may include communication chips (not shown) that may be configured to operate the device 400 (or 100) in accordance with a Global System for Mobile Communication (GSM), General Packet Radio Service (GPRS), Universal Mobile Telecommunications System (UMTS), High Speed Packet Access (HSPA), Evolved HSPA (E-HSPA), or LTE network. The communication chips may also be configured to operate in accordance with Enhanced Data for GSM Evolution (EDGE), GSM EDGE Radio Access Network (GERAN), Universal Terrestrial Radio Access Network (UTRAN), or Evolved UTRAN (E-UTRAN). The communication chips may be configured to operate in accordance with Code Division Multiple Access (CDMA), Time Division Multiple Access (TDMA), Digital Enhanced Cordless Telecommunications (DECT), Evolution-Data Optimized (EV-DO), derivatives thereof, as well as any other wireless protocols that are designated as 3G, 4G, 5G, and beyond. The communication interfaces 410 may operate in accordance with other wireless protocols in other embodiments.

The above-described computing device 400 elements may be coupled to each other via system bus 412, which may represent one or more buses. In the case of multiple buses, they may be bridged by one or more bus bridges (not shown). Each of these elements may perform its conventional functions known in the art. In particular, system memory 404 and mass storage devices 406 may be employed to store a working copy and a permanent copy of the programming instructions implementing the operations associated with apparatus 100, e.g., operations associated with providing augmentation management module 140 as described in reference to FIGS. 1-3, generally shown as computational logic 422. Computational logic 422 may be implemented by assembler instructions supported by processor(s) 402 or high-level languages that may be compiled into such instructions.

The permanent copy of the programming instructions may be placed into mass storage devices 406 in the factory, or in the field, through, for example, a distribution medium (not shown), such as a compact disc (CD), or through communication interfaces 410 (from a distribution server (not shown)).

FIG. 5 illustrates an example non-transitory computer-readable storage media 502 having instructions configured to practice all or selected ones of the operations associated with the processes described above. As illustrated, non-transitory computer-readable storage medium 502 may include a number of programming instructions 504 (e.g., including augmentation management module 140). Programming instructions 504 may be configured to enable a device, e.g., computing device 400, in response to execution of the programming instructions, to perform one or more operations of the processes described in reference to FIGS. 1-3. In alternate embodiments, programming instructions 504 may be disposed on multiple non-transitory computer-readable storage media 502 instead. In still other embodiments, programming instructions 504 may be encoded in transitory computer-readable signals.

Referring again to FIG. 4, the number, capability, and/or capacity of the elements 408, 410, 412 may vary, depending on whether computing device 400 is used as a stationary computing device, such as a set-top box or desktop computer, or a mobile computing device, such as a tablet computing device, laptop computer, game console, or smartphone. Their constitutions are otherwise known, and accordingly will not be further described.

At least one of processors 402 may be packaged together with memory having computational logic 422 configured to practice aspects of embodiments described in reference to FIGS. 1-4. For example, computational logic 422 may be configured to include or access content augmentation module 140, such as component 120 described in reference to FIG. 1. For one embodiment, at least one of the processors 402 may be packaged together with memory having computational logic 422 configured to practice aspects of process 300 of FIG. 3 to form a System in Package (SiP) or a System on Chip (SoC).

In various implementations, the computing device 500 may comprise a laptop, a netbook, a notebook, an ultrabook, a smartphone, a tablet, a personal digital assistant (PDA), an ultra mobile PC, a mobile phone, a desktop computer, a server, a printer, a scanner, a monitor, a set-top box, an entertainment control unit, a digital camera, a portable music player, or a digital video recorder. In further implementations, the computing device 400 may be any other electronic device that processes data.

The following paragraphs describe examples of various embodiments.

Example 1 is an apparatus for providing augmented reality computing, comprising: a processor; a scene capture camera coupled with the processor to capture a physical scene; and an augmentation management module to be operated by the processor to: obtain and analyze the physical scene, generate one or more virtual articles to augment a rendering of the physical scene, based on a result of the analysis, rack user interaction with the rendered augmented scene, and modify or complement the one or more virtual articles, in response to the tracked user interaction.

Example 2 may include the subject matter of Example 1, further comprising: an interaction capture camera coupled with the processor to capture the user interaction with the augmented scene and to provide information about the captured interaction to the augmentation management module for tracking.

Example 3 may include the subject matter of Example 2, wherein the apparatus is selected from one of: a laptop computing device, a tablet computing device, a mobile computing device, or an all-in-one (AIO) computing device.

Example 4 may include the subject matter of Example 1, wherein the augmentation management module to track user interaction with the rendered augmented scene includes to obtain an indication of user interaction with at least one of the one or more virtual articles in the rendered augmented scene.

Example 5 may include the subject matter of Example 4, wherein the augmentation management module to modify or complement the rendered augmented scene includes to: align the at least one of the one or more virtual articles in the rendered augmented scene with the indicated user interaction with the at least one of the one or more virtual articles in the rendered augmented scene; alter a position of the at least one of the one or more virtual articles in the rendered augmented scene in response to the indication of the user interaction with the at least one of the one or more virtual articles in the rendered augmented scene; or alter the at least one of the one or more virtual articles in the rendered augmented scene in response to the indication of the user interaction with the at least one of the one or more virtual articles in the rendered augmented scene.

Example 6 may include the subject matter of Example 4, wherein the indication of user interaction includes at least a selected one of: a gesture, a change of facial expression, a verbal command, a change in eye gaze, a change of posture, or a change of a head pose.

Example 7 may include the subject matter of Example 1, further comprising: a display device coupled with the processor to display the rendered augmented scene to the user.

Example 8 may include the subject matter of Example 1, wherein the augmentation management module to augment a rendering of the physical scene with one or more virtual articles includes to augment the rendering based at least in part on one or more markers associated with the physical scene.

Example 9 may include the subject matter of any of Examples 1 to 8, wherein the augmentation management module is to modify or complement the augmented scene substantially simultaneously with the tracking of the user interaction.

Example 10 may include the subject matter of Example 2, wherein each of the cameras comprises a two-dimensional (2D) or three-dimensional (3D) camera to capture a real-time depth data and color data associated with the physical scene or the user interaction respectively, wherein the color data includes red, green, and blue (RGB) data.

Example 11 may include the subject matter of Example 10, wherein the apparatus is placed on a substantially horizontal surface, to enable the capture of the physical scene and the user interaction.

Example 12 is a computer-implemented method for providing augmented reality computing, comprising: augmenting, by a computing device, a rendering of a physical scene with one or more virtual articles; tracking, by the computing device, user interaction with the rendered augmented scene, and modifying or complementing, by the computing device, the one or more virtual articles in the rendered augmented scene in response to the tracking of the user interaction.

Example 13 may include the subject matter of Example 12, further comprising: obtaining and analyzing, by the computing device, the physical scene; and generating, by the computing device, the one or more virtual articles to augment the rendering of the physical scene, based on the analyzing the physical scene.

Example 14 may include the subject matter of any of Examples 12 to 13, further comprising: rendering, by the computing device, the augmented scene for display.

Example 15 may include the subject matter of Example 14, wherein tracking user interaction with the rendered augmented scene includes: obtaining, by the computing device, an indication of user interaction with at least one of the one or more virtual articles in the rendered augmented scene, and wherein modifying or complementing the one or more articles further includes: aligning, by the computing device, the at least one of the one or more virtual articles in the rendered augmented scene with the indicated user interaction substantially simultaneously with and in response to obtaining of indication of the user interaction with the at least one of the one or more virtual articles in the rendered augmented scene.

Example 16 may include the subject matter of Example 15, wherein obtaining an indication of user interaction includes detecting, by the computing device, at least a selected one of: a gesture, a change of facial expression, a verbal command, a change in eye gaze, a change of posture, or a change of a head pose.

Example 17 is one or more computer-readable media having instructions for providing augmented reality computing stored thereon that, in response to execution by a computing device, provide the computing device with an augmentation management environment to: augment a rendering of a physical scene with one or more virtual articles; track user interaction with the rendered augmented scene, and modify or complement the one or more articles in the rendered augmented scene in response to the tracking of the user interaction.

Example 18 may include the subject matter of Example 17, wherein the computing device is further provided with the augmentation management environment to: obtain information about the physical scene; analyze information about the physical scene; and generate the one or more virtual articles to augment the rendering of the physical scene based on a result of the analysis of the physical scene.

Example 19 may include the subject matter of any of Examples 17 to 18, wherein the computing device is provided with the augmentation management environment to track user interaction with the rendered augmented scene includes the augmentation management environment to: obtain an indication of user interaction with at least one of the one or more virtual articles in the rendered augmented scene.

Example 20 may include the subject matter of Example 19, wherein the computing device is provided with the augmentation management environment to modify or complement the one or more articles includes the augmentation management environment to align the at least one of the one or more virtual articles in the scene with the indicated user interaction substantially simultaneously with and in response to the obtaining of the indication of user interaction with the at least one of the one or more virtual articles.

Example 21 is an apparatus for providing augmented reality computing, comprising: means for augmenting a rendering of a physical scene with one or more virtual articles; means for tracking user interaction with the rendered augmented scene, and means for modifying or complementing the one or more articles in the rendered augmented scene in response to the tracking of the user interaction.

Example 22 may include the subject matter of Example 21, further comprising: means for obtaining information about the physical scene; means for analyzing information about the physical scene; and means for generating the one or more virtual articles to augment the rendering of the physical scene based on a result of the analysis of the physical scene.

Example 23 may include the subject matter of any of Examples 21 to 22, wherein means for tracking user interaction with the rendered augmented scene include means for obtaining an indication of user interaction with at least one of the one or more virtual articles in the rendered augmented scene.

Example 24 may include the subject matter of Example 23, wherein means for modifying or complementing the one or more articles in the rendered augmented scene in response to the tracking of the user interaction include means for aligning the at least one of the one or more virtual articles in the scene with the indicated user interaction substantially simultaneously with and in response to the obtaining of the indication of user interaction with the at least one of the one or more virtual articles.

Computer-readable media (including non-transitory computer-readable media), methods, apparatuses, systems, and devices for performing the above-described techniques are illustrative examples of embodiments disclosed herein. Additionally, other devices in the above-described interactions may be configured to perform various disclosed techniques.

Although certain embodiments have been illustrated and described herein for purposes of description, a wide variety of alternate and/or equivalent embodiments or implementations calculated to achieve the same purposes may be substituted for the embodiments shown and described without departing from the scope of the present disclosure. This application is intended to cover any adaptations or variations of the embodiments discussed herein. Therefore, it is manifestly intended that embodiments described herein be limited only by the claims. 

1-20. (canceled)
 21. A computer comprising: a first camera to collect images of a physical scene; at least one sensor to track an eye gaze of a user; at least one screen; at least one microphone to detect a voice command of the user; at least one storage device; instructions; and at least one processor to execute the instructions to: cause display on the at least one screen of at least one virtual object, the at least one virtual object to be overlaid with respect to the physical scene; determine a first interaction with the at least one virtual object based on detection of a combination of (a) the eye gaze of the user, (b) the voice command of the user, and (c) a head pose of the user; and cause a simulation of a manipulation of the at least one virtual object to be displayed on the at least one screen in response to detection of the combination.
 22. The computer as defined in claim 21, wherein the at least one processor is to determine the interaction with the at least one virtual object based on detection of a hand gesture in the physical scene.
 23. The computer as defined in claim 22, wherein the hand gesture includes at least one of a point gesture or a wave gesture.
 24. The computer as defined in claim 21, wherein the at least one processor is to identify physical objects of the physical scene.
 25. The computer as defined in claim 24, wherein the at least one processor is to update a database of locations corresponding to the identified physical objects of the physical scene.
 26. The computer as defined in claim 21, wherein the computer includes at least one of a laptop, a tablet or a mobile phone.
 27. The computer as defined in claim 21, wherein the at least one processor is to cause display of the at least one virtual object based on detection of one or more markers of the physical scene.
 28. At least one non-transitory computer-readable medium comprising machine-readable instructions that, when executed, cause at least one processor to at least: cause tracking of an eye gaze of a user; cause monitoring of a voice command of the user; cause display on at least one screen of at least one virtual object, wherein the at least one virtual object is overlaid with respect to a physical scene; determine a first interaction with the at least one virtual object based on detection of a combination of (a) the eye gaze of the user, (b) the voice command of the user, and (c) a head pose of the user; and cause simulation of a manipulation of the at least one virtual object to be displayed on the at least one screen in response to detection of the combination.
 29. The at least one non-transitory computer-readable medium as defined in claim 28, wherein the instructions, when executed, cause the at least one processor to determine the interaction with the at least one virtual object based on detection of a hand gesture in the physical scene.
 30. The at least one non-transitory computer-readable medium as defined in claim 29, wherein the instructions, when executed, cause the at least one processor to detect the hand gesture as at least one of a point gesture or a wave gesture.
 31. The at least one non-transitory computer-readable medium as defined in claim 28, wherein the instructions, when executed, cause the at least one processor to identify physical objects of the physical scene.
 32. The at least one non-transitory computer-readable medium as defined in claim 31, wherein the instructions, when executed, cause the at least one processor to update a database of locations corresponding to the identified physical objects of the physical scene.
 33. The at least one non-transitory computer-readable medium as defined in claim 28, wherein the instructions, when executed, cause the at least one processor to cause display of the at least one virtual object based on detection of one or more markers of the physical scene.
 34. A mobile computing device comprising: at least one screen; memory; and processor circuitry to: cause tracking of an eye gaze of a user; cause monitoring of a voice command of the user; cause display on the at least one screen of at least one virtual object, the at least one virtual object to be overlaid with respect to a physical scene; determine a first interaction with the at least one virtual object based on detection of a combination of (a) the eye gaze of the user, (b) the voice command of the user, and (c) a head pose of the user; and cause simulation of a manipulation of the at least one virtual object to be displayed on the at least one screen in response to detection of the combination.
 35. The mobile computing device as defined in claim 34, wherein the processor circuitry is to determine the interaction with the at least one virtual object based on detection of a hand gesture in the physical scene, wherein the hand gesture is at least one of a point gesture or a wave gesture.
 36. The mobile computing device as defined in claim 34, wherein the processor circuitry is to identify physical objects of the physical scene.
 37. The mobile computing device as defined in claim 36, wherein the processor circuitry is to update a database of locations corresponding to the identified physical objects of the physical scene.
 38. The mobile computing device as defined in claim 34, wherein the processor circuitry is to cause display of the at least one virtual object based on detection of one or more markers of the physical scene. 